AITopics | amazon sagemaker

Collaborating Authors

amazon sagemaker

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2

Xue, Tengfei, Li, Xuefeng, Smirnov, Roman, Azim, Tahir, Sadrieh, Arash, Pahlavan, Babak

arXiv.org Artificial IntelligenceJul-11-2024

Retrieval-augmented generation (RAG) techniques are widely used today to retrieve and present information in a conversational format. This paper presents a set of enhancements to traditional RAG techniques, focusing on large language models (LLMs) fine-tuned and hosted on AWS Trainium and Inferentia2 AI chips via SageMaker. These chips are characterized by their elasticity, affordability, and efficient performance for AI compute tasks. Besides enabling deployment on these chips, this work aims to improve tool usage, add citation capabilities, and mitigate the risks of hallucinations and unsafe responses due to context bias. We benchmark our RAG system's performance on the Natural Questions and HotPotQA datasets, achieving an accuracy of 62% and 59% respectively, exceeding other models such as DBRX and Mixtral Instruct.

amazon sagemaker, amazon sagemaker and aw trainium, aw trainium, (12 more...)

arXiv.org Artificial Intelligence

2407.12057

Country: North America > United States (0.14)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

Comparative Analysis of AWS Model Deployment Services

Bagai, Rahul

arXiv.org Artificial IntelligenceMay-13-2024

Amazon Web Services (AWS) offers three important Model Deployment Services for model developers: SageMaker, Lambda, and Elastic Container Service (ECS). These services have critical advantages and disadvantages, influencing model developer's adoption decisions. This comparative analysis reviews the merits and drawbacks of these services. This analysis found that Lambda AWS service leads in efficiency, autoscaling aspects, and integration during model development. However, ECS was found to be outstanding in terms of flexibility, scalability, and infrastructure control; conversely, ECS is better suited when it comes to managing complex container environments during model development, as well as addressing budget concerns -- it is, therefore, the preferred option for model developers whose objective is to achieve complete freedom and framework flexibility with horizontal scaling. ECS is better suited to ensuring performance requirements align with project goals and constraints. The AWS service selection process considered factors that include but are not limited to load balance and cost-effectiveness. ECS is a better choice when model development begins from the abstract. It offers unique benefits, such as the ability to scale horizontally and vertically, making it the best preferable tool for model deployment.

application, aw lambda, sagemaker, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.14445/22312803/IJCTT-V72I5P113

2405.08175

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
Europe > Italy > Sardinia > Cagliari (0.04)
Europe > France (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (0.93)
Information Technology > Services (0.69)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(3 more...)

Add feedback

Comparative Analysis of Retrieval Systems in the Real World

Mozolevskyi, Dmytro, AlShikh, Waseem

arXiv.org Artificial IntelligenceMay-3-2024

This research paper presents a comprehensive analysis of integrating advanced language models with search and retrieval systems in the fields of information retrieval and natural language processing. The objective is to evaluate and compare various state-of-the-art methods based on their performance in terms of accuracy and efficiency. The analysis explores different combinations of technologies, including Azure Cognitive Search Retriever with GPT-4, Pinecone's Canopy framework, Langchain with Pinecone and different language models (OpenAI, Cohere), LlamaIndex with Weaviate Vector Store's hybrid search, Google's RAG implementation on Cloud VertexAI-Search, Amazon SageMaker's RAG, and a novel approach called KG-FID Retrieval. The motivation for this analysis arises from the increasing demand for robust and responsive question-answering systems in various domains. The RobustQA metric is used to evaluate the performance of these systems under diverse paraphrasing of questions. The report aims to provide insights into the strengths and weaknesses of each method, facilitating informed decisions in the deployment and development of AI-driven search and retrieval systems.

accuracy, language model, response time, (13 more...)

arXiv.org Artificial Intelligence

2405.02048

Country: North America > Canada > Ontario > Toronto (0.05)

Genre: Research Report > Promising Solution (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.57)

Add feedback

Connect Amazon EMR and RStudio on Amazon SageMaker

#artificialintelligenceApr-17-2023, 20:11:52 GMT

RStudio on Amazon SageMaker is the industry's first fully managed RStudio Workbench integrated development environment (IDE) in the cloud. You can quickly launch the familiar RStudio IDE and dial up and down the underlying compute resources without interrupting your work, making it easy to build machine learning (ML) and analytics solutions in R at scale. In conjunction with tools like RStudio on SageMaker, users are analyzing, transforming, and preparing large amounts of data as part of the data science and ML workflow. Data scientists and data engineers use Apache Spark, Hive, and Presto running on Amazon EMR for large-scale data processing. Using RStudio on SageMaker and Amazon EMR together, you can continue to use the RStudio IDE for analysis and development, while using Amazon EMR managed clusters for larger data processing.

emr cluster, rstudio, sagemaker, (12 more...)

#artificialintelligence

Country: North America > United States > Texas > Dallas County > Dallas (0.05)

Industry:

Information Technology (0.73)
Retail > Online (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.99)
Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Bring legacy machine learning code into Amazon SageMaker using AWS Step Functions

#artificialintelligenceMar-15-2023, 18:32:30 GMT

Tens of thousands of AWS customers use AWS machine learning (ML) services to accelerate their ML development with fully managed infrastructure and tools. For customers who have been developing ML models on premises, such as their local desktop, they want to migrate their legacy ML models to the AWS Cloud to fully take advantage of the most comprehensive set of ML services, infrastructure, and implementation resources available on AWS. The term legacy code refers to code that was developed to be manually run on a local desktop, and is not built with cloud-ready SDKs such as the AWS SDK for Python (Boto3) or Amazon SageMaker Python SDK. The best practice for migration is to refactor these legacy codes using the Amazon SageMaker API or the SageMaker Python SDK. However, in some cases, organizations with a large number of legacy models may not have the time or resources to rewrite all those models.

artificial intelligence, container, machine learning, (19 more...)

#artificialintelligence

Genre: Workflow (0.73)

Industry:

Retail > Online (0.40)
Education > Curriculum > Subject-Specific Education (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Amazon SageMaker built-in LightGBM now offers distributed training using Dask

#artificialintelligenceJan-30-2023, 18:10:25 GMT

Amazon SageMaker provides a suite of built-in algorithms, pre-trained models, and pre-built solution templates to help data scientists and machine learning (ML) practitioners get started on training and deploying ML models quickly. You can use these algorithms and models for both supervised and unsupervised learning. They can process various types of input data, including tabular, image, and text. Starting today, the SageMaker LightGBM algorithm offers distributed training using the Dask framework for both tabular classification and regression tasks. The supported data format can be either CSV or Parquet.

algorithm, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Connecting Amazon Redshift and RStudio on Amazon SageMaker

#artificialintelligenceJan-7-2023, 00:15:49 GMT

Last year, we announced the general availability of RStudio on Amazon SageMaker, the industry's first fully managed RStudio Workbench integrated development environment (IDE) in the cloud. You can quickly launch the familiar RStudio IDE and dial up and down the underlying compute resources without interrupting your work, making it easy to build machine learning (ML) and analytics solutions in R at scale. Many of the RStudio on SageMaker users are also users of Amazon Redshift, a fully managed, petabyte-scale, massively parallel data warehouse for data storage and analytical workloads. It makes it fast, simple, and cost-effective to analyze all your data using standard SQL and your existing business intelligence (BI) tools. The use of RStudio on SageMaker and Amazon Redshift can be helpful for efficiently performing analysis on large data sets in the cloud.

amazon redshift, machine learning, natural language, (18 more...)

#artificialintelligence

Industry:

Banking & Finance (0.50)
Retail > Online (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.99)
Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

Augment fraud transactions using synthetic data in Amazon SageMaker

#artificialintelligenceDec-16-2022, 17:42:04 GMT

Developing and training successful machine learning (ML) fraud models requires access to large amounts of high-quality data. Sourcing this data is challenging because available datasets are sometimes not large enough or sufficiently unbiased to usefully train the ML model and may require significant cost and time. Regulation and privacy requirements further prevent data use or sharing even within an enterprise organization. The process of authorizing the use of, and access to, sensitive data often delays or derails ML projects. Alternatively, we can tackle these challenges by generating and using synthetic data.

artificial intelligence, machine learning, synthetic data, (17 more...)

#artificialintelligence

Country: Europe > United Kingdom (0.15)

Genre: Instructional Material > Course Syllabus & Notes (0.48)

Industry:

Law (0.70)
Law Enforcement & Public Safety > Fraud (0.51)
Retail > Online (0.40)
Information Technology > Security & Privacy (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

Informatica data science framework connects with Amazon SageMaker - Channel Asia

#artificialintelligenceDec-13-2022, 11:06:13 GMT

Informatica has launched a cloud-based development and data science framework, called INFACore, that promises to simplify the process of composing data pipelines for building and deploying machine learning models in Amazon SageMaker Studio. Powered by Informatica's Intelligent Data Management Cloud, INFACore is described as an intelligent headless data management platform for developers, data scientists, and data engineers. Simplifying the development of complex data pipelines, INFACore can turn thousands of lines of code into a single function that can be deployed into applications using a native UI supported on Amazon SageMaker Studio, the company said. INFACore went into a beta stage in May and is now generally available. Integration between INFACore and other cloud platforms besides AWS is anticipated at some point.

amazon sagemaker, infacore, informatica data science framework connect, (5 more...)

#artificialintelligence

Country: Asia (0.40)

Genre: Press Release (0.50)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.64)

Add feedback

GitHub - aws/sagemaker-python-sdk: A library for training and deploying machine learning models on Amazon SageMaker

#artificialintelligenceDec-12-2022, 00:43:29 GMT

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the SDK, you can train and deploy models using popular deep learning frameworks Apache MXNet and TensorFlow. You can also train and deploy models with Amazon algorithms, which are scalable implementations of core machine learning algorithms that are optimized for SageMaker and GPU training. If you have your own algorithms built into SageMaker compatible Docker containers, you can train and host models using these as well. For detailed documentation, including the API reference, see Read the Docs.

amazon sagemaker, library, sagemaker python sdk, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback